100 + TFlop Solidification Simulations on BlueGene / L
نویسندگان
چکیده
We investigate solidification in tantalum and uranium systems ranging in size from 64,000 to 524,288,000 atoms on the IBM BlueGene/L computer at LLNL. Using the newly developed ddcMD code, we achieve performance rates as high as 103 TFlops, with a performance of 101.7 TFlop sustained over a 7 hour run on 131,072 cpus. We demonstrate superb strong and weak scaling. Our calculations are significant as they represent the first atomic-scale model of metal solidification to proceed, without finite size effects, from spontaneous nucleation and growth of solid out of the liquid, through the coalescence phase, and into the onset of coarsening. Thus, our simulations represent the first step towards an atomistic model of nucleation and growth that can directly link atomistic to mesoscopic length scales.
منابع مشابه
Emulating Ibm Bluegene on a Linux Mpi Cluster
The new and previous No. 1 is DOE’s IBM BlueGene/L system, installed at DOE’s Lawrence Livermore National Laboratory (LLNL). It has doubled in size (again) and has now achieved a record Linpack performance of 280.6 TFlop/s. It is still the only system ever to exceed the 100 TFlop/s mark. This project is being carried out at WAran Research FoundaTion as a part of WARFT’s major research initiativ...
متن کامل25 Tflop/s Multibillion-atom Molecular Dynamics Simulations and Visualization/analysis on Bluegene/l
We demonstrate the excellent performance and scalability of a classical molecular dynamics code, SPaSM, on the IBM BlueGene/L supercomputer at LLNL. Simulations involving up to 160 billion atoms (micron-size cubic samples) on 65,536 processors are reported, consistently achieving 24.4–25.5 Tflop/s for the commonly used Lennard-Jones 6-12 pairwise interaction potential. Two extended production s...
متن کاملAutomatically Tuned FFTs for BlueGene/L's Double FPU
IBM is currently developing the new line of BlueGene/L supercomputers. The top-of-the-line installation is planned to be a 65,536 processors system featuring a peak performance of 360 Tflop/s. This system is supposed to lead the Top 500 list when being installed in 2005 at the Lawrence Livermore National Laboratory. This paper presents one of the first numerical kernels run on a prototype BlueG...
متن کاملAutomatic Generation of the HPC Challenge's Global FFT Benchmark for BlueGene/P
We present the automatic synthesis of the HPC Challenge’s Global FFT, a large 1D FFT across a whole supercomputer system. We extend the Spiral system to synthesize specialized single-node FFT libraries that combine a data layout transformation with the actual on-node FFT computation to improve the network performance through enabling all-to-all collectives. We run our optimized Global FFT bench...
متن کاملEnabling Dual-Core Mode in BlueGene/L: Challenges and Solutions
BlueGene/L is a massively parallel computer system with 65,536 dual-processor compute nodes. The peak performance of BlueGene/L is in excess of 360 TFLOP/s if both processor cores in a node are used for computation. The main challenge of deploying this dual-core mode of operation is that the L1 caches in each core are not hardware coherent. This forces a software-based approach to cache coheren...
متن کامل